PASS: Program Structure Aware Stratified Sampling for Statistically Selecting Instruction Traces and Simulation Points

نویسندگان

  • Sreekumar V. Kodakara
  • Jinpyo Kim
  • Wei-chung Hsu
  • David J. Lilja
  • Wei-Chung Hsu
چکیده

As modeled microarchitectures become more complex and the size of benchmark program keeps increasing, simulating a complete program with various input sets is practically infeasible within a given time and computation resource budget. A common approach is to simulate only a subset of representative parts of the program selected from the complete program execution. SimPoint [1,2] and SMARTS [10] have shown that accurate performance estimates can be achieved with a relatively small number of instructions. This paper proposes a novel method called Program structure Aware Stratified Sampling (PASS) for further reducing microarchitecture simulation time without losing accuracy and coverage. PASS has four major phases, consisting of building Extended Calling Context Tree (ECCT), dynamic code region analysis, program behavior profiling, and stratified sampling. ECCT is constructed to represent program calling context and repetitive behavior via dynamic instrumentation. Dynamic code region analysis identifies code regions with similar program phase behaviors. Program behavior profiling stores statistical information of program behaviors such as number of executed instructions, branch mispredictions, and cache miss associated with each code region. Based on the variability of each phase, we adaptively sample instances of instruction streams through stratified sampling. We applied PASS on 12 SPEC CPU2000 benchmark and input combinations and achieved average 1.46 % IPC error bound from measurements of native execution on Itanium-2 machine with much smaller sampled instruction streams.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Power Aware Encoding for the Instruction Address Buses Using Program Constructs

This paper examines the address traces produced by various program constructs. By using the correlation induced by the program constructs within these traces, we develop a scalable bus encoding algorithm to significantly reduce the switching activity on the instruction address bus. Simulation results for Spec2000 benchmarks show that for modest coding complexity, the proposed scheme reduces swi...

متن کامل

An Evaluation of Stratified Sampling of Microarchitecture Simulations

Recent research advocates applying sampling to accelerate microarchitecture simulation. Simple random sampling offers accurate performance estimates (with a high quantifiable confidence) by taking a large number (e.g., 10,000) of short performance measurements over the full length of a benchmark. Simple random sampling does not exploit the often repetitive behaviors of benchmarks, collecting ma...

متن کامل

Automatically Selecting Representative Traces for Simulation Based on Cluster Analysis of Instruction Address Hashes

Simulation of standard benchmarks is the most important method for computer architects to evaluate architectural enhancement. However, modern benchmarks usually take prohibitively long time to run in cycle accurate simulator. In this paper we propose a simple method that is based on clustering analysis of instruction address hashes to identify representative simulation intervals. Preliminary re...

متن کامل

Accelerating Architectural Simulation by Parallel Execution of Trace Samples

In order to quickly decide which architectural features are to be included in future processors, we have developed a simulation approach that uses samples of benchmark program instruction traces. Rather than simulating a proposed architecture on the entire SPEC92 program suite of more than 100 billion instructions, we simulate using a set of samples of the SPEC92 suite containing less than 1% o...

متن کامل

A Novel Structure for Realization of a Pseudo Two Path Band-Pass Filter

In this paper, a modified auto zeroed integrator is used to design and simulate a low-voltage high-Q switched capacitor pseudo 2-path filter.  The filter is a sixth–order Chebyshev band-pass filter operating at sampling frequency of 1MHz and center frequency of 250 kHz with a quality factor of 50. The proposed filter has both low-voltage and high speed properties of the auto zeroed integrators ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005